The Power of Confidence Intervals
نویسندگان
چکیده
We connect the power of Confidence Intervals in different Frequentist methods to their reliability. We show that in the case of a bounded parameter a biased method which near the boundary has large power in testing the parameter against larger alternatives and small power in testing the parameter against smaller alternatives is desirable. Considering the recently proposed methods with correct coverage, we show that the Maximum Likelihood Estimator method [1, 2] has optimal bias. It is well known that the most important property of Frequentist Confidence Intervals is coverage: a Confidence Interval belong to a set of intervals that cover the true value of the measured quantity with Frequentist probability . Neyman’s method obtains Confidence Intervals with correct coverage through the construction for each possible value of of an acceptance interval with probability for an estimator of . The union of all acceptance intervals in the – plane is called the Confidence Belt. The Confidence Interval for resulting from a measurement of the estimator is the set of all values of whose acceptance interval for include . Coverage is not the only property of Confidence Intervals, because many methods for the construction of a Confidence Belt with exact coverage are available (see Refs. [3, 4, 5, 1, 2]). These methods differ by power [6], a quantity which is obtained considering the construction of acceptance intervals as hypothesis testing. Coverage and power are connected, respectively, with the so-called Type I and Type II errors in testing a simple statistical hypothesis !#" against a simple alternative hypothesis !%$ (see Ref. [3], section 20.9): Type I error: Reject the null hypothesis !#" when it is true. The probability of a Type I error is called size of the test and it is usually denoted by . Type II error: Accept the null hypothesis !#" when the alternative hypothesis !&$ is true. The probability of a Type II error is usually denoted by ' . The power of a test is the probability (*)+ ,' to reject !#" if !%$ is true. A test is Most Powerful if its power is the largest one among all possible tests. This is clearly the best choice. Unfortunately, the power associated with a confidence belt is not easy to evaluate, because for each possible value -" of considered as a null hypothesis there is no simple alternative hypothesis that allows to calculate the probability ' of a Type II error. Instead, we have the alternative hypothesis !%$ : $/. )0 -" , which is composite. For each value of $/. )1 -" one can calculate the probability ' 2435 6 $7 of a Type II error associated with a given acceptance interval corresponding to " . A method that gives an acceptance region for -" which has the largest possible power ( 283 6 $9 :); < =' 283 6 $ is Most Powerful with respect to the alternative $ . Clearly, it would be desirable to find a Uniformly Most Powerful test, i.e. a test that gives an acceptance region for -" which has the largest possible power ( 243> 6 $ for any value of $ . Unfortunately, the Neyman-Pearson lemma implies that in general a Uniformly Most Powerful test does not exist if the alternative hypothesis is two-sided, i.e. both $@? " and $ A " are possible, and the derivative of the Likelihood with respect to is continuous in -" (see Ref. [3], section 20.18). Nevertheless, it is possible to find a Uniformly Most Powerful test if the class of tests is restricted
منابع مشابه
Confidence Intervals for the Power of Two-Sided Student’s t-test
For the power of two-sided hypothesis testing about the mean of a normal population, we derive a 100(1 − alpha)% confidence interval. Then by using a numerical method we will find a shortest confidence interval and consider some special cases.
متن کاملInference on Pr(X > Y ) Based on Record Values From the Power Hazard Rate Distribution
In this article, we consider the problem of estimating the stress-strength reliability $Pr (X > Y)$ based on upper record values when $X$ and $Y$ are two independent but not identically distributed random variables from the power hazard rate distribution with common scale parameter $k$. When the parameter $k$ is known, the maximum likelihood estimator (MLE), the approximate Bayes estimator and ...
متن کاملExact maximum coverage probabilities of confidence intervals with increasing bounds for Poisson distribution mean
A Poisson distribution is well used as a standard model for analyzing count data. So the Poisson distribution parameter estimation is widely applied in practice. Providing accurate confidence intervals for the discrete distribution parameters is very difficult. So far, many asymptotic confidence intervals for the mean of Poisson distribution is provided. It is known that the coverag...
متن کاملBootstrap confidence intervals of CNpk for type‑II generalized log‑logistic distribution
This paper deals with construction of confidence intervals for process capability index using bootstrap method (proposed by Chen and Pearn in Qual Reliab Eng Int 13(6):355–360, 1997) by applying simulation technique. It is assumed that the quality characteristic follows type-II generalized log-logistic distribution introduced by Rosaiah et al. in Int J Agric Stat Sci 4(2):283–292, (2008). Discu...
متن کاملConfidence Intervals for Lower Quantiles Based on Two-Sample Scheme
In this paper, a new two-sampling scheme is proposed to construct appropriate confidence intervals for the lower population quantiles. The confidence intervals are determined in the parametric and nonparametric set up and the optimality problem is discussed in each case. Finally, the proposed procedure is illustrated via a real data set.
متن کاملDistribution Free Confidence Intervals for Quantiles Based on Extreme Order Statistics in a Multi-Sampling Plan
Extended Abstract. Let Xi1 ,..., Xini ,i=1,2,3,....,k be independent random samples from distribution $F^{alpha_i}$، i=1,...,k, where F is an absolutely continuous distribution function and $alpha_i>0$ Also, suppose that these samples are independent. Let Mi,ni and M'i,ni respectively, denote the maximum and minimum of the ith sa...
متن کامل